A novel energy distribution comparison approach for robust speech spectrum vector quantization
نویسندگان
چکیده
Vector Quantization (VQ) has been extensively used in speech vocoders. The training process normally requires a very large training-set. This paper introduces a novel energy distribution comparison distortion measure for the high-band speech spectrum that enables the vector quantizer to operate given a relatively small training-set. This measure has been used in the construction of a segmental vocoder using the pitch period as segments. A description of the proposed approach, the Energy-Mass distortion measure, is given and compared to the use of MFCC as a distortion measure showing the ability of the proposed approach to better represent the speech formants, when operating under the small training-set constraint. Finally, the performance of the new Energy-Mass is evaluated using the Spectral Distortion (SD). Speech quality perceived by the receiver is evaluated using the recently standardized objective quality measure PESQ, where an improvement of 0.3 PESQ score was obtained.
منابع مشابه
Robust speech recognition using missing feature theory and vector quantization
This paper addresses the problem of speech recognition in noisy conditions when low complexity is required like in embedded systems. In such systems, vector quantization is generally used to reduce the complexity of the recognition systems (e.g. HMMs). A novel approach for vector quantization based on the missing data theory is proposed. This approach allows to increase the robustness of the sy...
متن کاملRobust speech mode based LSF vector quantization for low bit rate coders
Robust vector quantization of LSF parameters at a low bit rate is essential for voice coders operating below 5 Kbps. A novel aspect of the proposed technique is the use of decorrelated residual LSF vectors from speech mode based backward prediction along with a multi-stage VQ design. Rates as low as 12 bits per 20 ms speech frame for the stationary voiced speech mode and 22 bits/frame for unvoi...
متن کاملUsing FFI Interpolator and VQ Quantization for Designing of High Quality 1200 BPS Speech Vocoder
Storaging or transmission of speech signals at very low bit rate is a hot area in the field of speech processing. We used stochastic inter-frame interpolators and vector quantization (VQ) as a new method for developing a high quality 1200 BPS speech vocoder. The objective and subjecgtive test results show that performance of the new vocoder is compairable with 4800 BPS standard vocoders (as CELP).
متن کاملUsing FFI Interpolator and VQ Quantization for Designing of High Quality 1200 BPS Speech Vocoder
Storaging or transmission of speech signals at very low bit rate is a hot area in the field of speech processing. We used stochastic inter-frame interpolators and vector quantization (VQ) as a new method for developing a high quality 1200 BPS speech vocoder. The objective and subjecgtive test results show that performance of the new vocoder is compairable with 4800 BPS standard vocoders (as CELP).
متن کاملImproving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007